AITopics | Anau

Collaborating Authors

Anau

159f7fe5b51ecd663b85337e8e28ce65-Paper-Conference.pdf

Neural Information Processing SystemsFeb-7-2026, 14:56:34 GMT

inference, nre, posterior, (15 more...)

Neural Information Processing Systems

Country:

North America > United States (0.14)
Europe > Netherlands > North Holland > Amsterdam (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Turkmenistan > Ahal Region > Anau (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.95)
Information Technology > Artificial Intelligence > Natural Language (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.47)

Add feedback

Nested Slice Sampling: Vectorized Nested Sampling for GPU-Accelerated Inference

Yallup, David, Kroupa, Namu, Handley, Will

arXiv.org Machine LearningFeb-2-2026

Model comparison and calibrated uncertainty quantification often require integrating over parameters, but scalable inference can be challenging for complex, multimodal targets. Nested Sampling is a robust alternative to standard MCMC, yet its typically sequential structure and hard constraints make efficient accelerator implementations difficult. This paper introduces Nested Slice Sampling (NSS), a GPU-friendly, vectorized formulation of Nested Sampling that uses Hit-and-Run Slice Sampling for constrained updates. A tuning analysis yields a simple near-optimal rule for setting the slice width, improving high-dimensional behavior and making per-step compute more predictable for parallel execution. Experiments on challenging synthetic targets, high dimensional Bayesian inference, and Gaussian process hyperparameter marginalization show that NSS maintains accurate evidence estimates and high-quality posterior samples, and is particularly robust on difficult multimodal problems where current state-of-the-art methods such as tempered SMC baselines can struggle. An open-source implementation is released to facilitate adoption and reproducibility.

artificial intelligence, machine learning, sampling, (19 more...)

arXiv.org Machine Learning

2601.23252

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
Asia > Turkmenistan > Ahal Region > Anau (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
(2 more...)

Genre: Research Report > New Finding (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

Blind Strong Gravitational Lensing Inversion: Joint Inference of Source and Lens Mass with Score-Based Models

Barco, Gabriel Missael, Legin, Ronan, Stone, Connor, Hezaveh, Yashar, Perreault-Levasseur, Laurence

arXiv.org Artificial IntelligenceNov-10-2025

Score-based models can serve as expressive, data-driven priors for scientific inverse problems. In strong gravitational lensing, they enable posterior inference of a background galaxy from its distorted, multiply-imaged observation. Previous work, however, assumes that the lens mass distribution (and thus the forward operator) is known. We relax this assumption by jointly inferring the source and a parametric lens-mass profile, using a sampler based on GibbsDDRM but operating in continuous time. The resulting reconstructions yield residuals consistent with the observational noise, and the marginal posteriors of the lens parameters recover true values without systematic bias. To our knowledge, this is the first successful demonstration of joint source-and-lens inference with a score-based prior.

artificial intelligence, machine learning, posterior, (14 more...)

arXiv.org Artificial Intelligence

2511.04792

Country:

North America > United States > California > San Diego County > San Diego (0.04)
North America > Canada > Quebec > Montreal (0.04)
North America > Canada > British Columbia > Vancouver (0.04)
(2 more...)

Genre: Research Report (0.65)

Technology:

Information Technology > Artificial Intelligence > Vision (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)
Information Technology > Sensing and Signal Processing > Image Processing (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

A Probabilistic U-Net Approach to Downscaling Climate Simulations

Alipourhajiagha, Maryam, Lemaire, Pierre-Louis, Diouane, Youssef, Carreau, Julie

arXiv.org Artificial IntelligenceNov-6-2025

Climate models are limited by heavy computational costs, often producing outputs at coarse spatial resolutions, while many climate change impact studies require finer scales. Statistical downscaling bridges this gap, and we adapt the probabilistic U-Net for this task, combining a deterministic U-Net backbone with a variational latent space to capture aleatoric uncertainty. We evaluate four training objectives, afCRPS and WMSE-MS-SSIM with three settings for downscaling precipitation and temperature from $16\times$ coarser resolution. Our main finding is that WMSE-MS-SSIM performs well for extremes under certain settings, whereas afCRPS better captures spatial variability across scales.

artificial intelligence, machine learning, precipitation, (16 more...)

arXiv.org Artificial Intelligence

2511.03197

Country:

Oceania > New Zealand (0.04)
North America > Canada > Quebec > Montreal (0.04)
Europe (0.04)
(2 more...)

Genre:

Research Report (0.40)
Instructional Material (0.38)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.71)

Add feedback

Dynamic SBI: Round-free Sequential Simulation-Based Inference with Adaptive Datasets

Lyu, Huifang, Alvey, James, Montel, Noemi Anau, Pieroni, Mauro, Weniger, Christoph

arXiv.org Machine LearningOct-17-2025

Simulation-based inference (SBI) is emerging as a new statistical paradigm for addressing complex scientific inference problems. By leveraging the representational power of deep neural networks, SBI can extract the most informative simulation features for the parameters of interest. Sequential SBI methods extend this approach by iteratively steering the simulation process towards the most relevant regions of parameter space. This is typically implemented through an algorithmic structure, in which simulation and network training alternate over multiple rounds. This strategy is particularly well suited for high-precision inference in high-dimensional settings, which are commonplace in physics applications with growing data volumes and increasing model fidelity. Here, we introduce dynamic SBI, which implements the core ideas of sequential methods in a round-free, asynchronous, and highly parallelisable manner. At its core is an adaptive dataset that is iteratively transformed during inference to resemble the target observation. Simulation and training proceed in parallel: trained networks are used both to filter out simulations incompatible with the data and to propose new, more promising ones. Compared to round-based sequential methods, this asynchronous structure can significantly reduce simulation costs and training overhead. We demonstrate that dynamic SBI achieves significant improvements in simulation and training efficiency while maintaining inference performance. We further validate our framework on two challenging astrophysical inference tasks: characterising the stochastic gravitational wave background and analysing strong gravitational lensing systems. Overall, this work presents a flexible and efficient new paradigm for sequential SBI.

artificial intelligence, arxiv, machine learning, (17 more...)

arXiv.org Machine Learning

2510.13997

Country:

Asia > Turkmenistan > Ahal Region > Anau (0.05)
Europe > Netherlands > North Holland > Amsterdam (0.04)
Europe > Spain > Galicia > Madrid (0.04)
(6 more...)

Genre: Research Report (1.00)

Industry: Education (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

71883294314045d60c900113a359934b-Paper-Conference.pdf

Neural Information Processing SystemsOct-10-2025, 05:53:25 GMT

dataset, modality, tokenizer, (14 more...)

Neural Information Processing Systems

Country:

Asia > Japan > Honshū > Chūbu > Ishikawa Prefecture > Kanazawa (0.04)
Oceania > New Zealand (0.04)
North America > United States > New York (0.04)
(5 more...)

Genre: Research Report > Experimental Study (0.93)

Industry:

Information Technology (0.67)
Transportation > Ground > Road (0.45)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(3 more...)

Add feedback

36ce475705c1dc6c50a5956cedff3d01-Supplemental-Conference.pdf

Neural Information Processing SystemsOct-8-2025, 11:02:20 GMT

approximation, artificial intelligence, machine learning, (18 more...)

Neural Information Processing Systems

Country:

North America > United States > New York > New York County > New York City (0.14)
North America > United States > California > Los Angeles County > Los Angeles (0.14)
North America > United States > California > Santa Clara County > Palo Alto (0.14)
(33 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Data Science (0.67)

Add feedback

Contrastive Neural Ratio Estimation

Neural Information Processing SystemsOct-2-2025, 13:23:03 GMT

Notation details are defined in Section 2.1.

artificial intelligence, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country:

North America > United States (0.14)
Europe > Netherlands > North Holland > Amsterdam (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Turkmenistan > Ahal Region > Anau (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.95)
Information Technology > Artificial Intelligence > Natural Language (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.47)

Add feedback

ORBIT-2: Scaling Exascale Vision Foundation Models for Weather and Climate Downscaling

Wang, Xiao, Choi, Jong-Youl, Kurihaya, Takuya, Lyngaas, Isaac, Yoon, Hong-Jun, Xiao, Xi, Pugmire, David, Fan, Ming, Nafi, Nasik M., Tsaris, Aristeidis, Aji, Ashwin M., Hossain, Maliha, Wahib, Mohamed, Wang, Dali, Thornton, Peter, Balaprakash, Prasanna, Ashfaq, Moetasim, Lu, Dan

arXiv.org Artificial IntelligenceSep-3-2025

Sparse observations and coarse-resolution climate models limit effective regional decision-making, underscoring the need for robust downscaling. However, existing AI methods struggle with generalization across variables and geographies and are constrained by the quadratic complexity of Vision Transformer (ViT) self-attention. We introduce ORBIT-2, a scalable foundation model for global, hyper-resolution climate downscaling. ORBIT-2 incorporates two key innovations: (1) Residual Slim ViT (Reslim), a lightweight architecture with residual learning and Bayesian regularization for efficient, robust prediction; and (2) TILES, a tile-wise sequence scaling algorithm that reduces self-attention complexity from quadratic to linear, enabling long-sequence processing and massive parallelism. ORBIT-2 scales to 10 billion parameters across 65,536 GPUs, achieving up to 4.1 exaFLOPS sustained throughput and 74--98% strong scaling efficiency. It supports downscaling to 0.9 km global resolution and processes sequences up to 4.2 billion tokens. On 7 km resolution benchmarks, ORBIT-2 achieves high accuracy with $R^2$ scores in the range of 0.98--0.99 against observational data.

machine learning, natural language, resolution, (17 more...)

arXiv.org Artificial Intelligence

2505.04802

Country:

North America > United States > District of Columbia > Washington (0.05)
North America > United States > Tennessee > Anderson County > Oak Ridge (0.04)
North America > United States > Oregon > Benton County > Corvallis (0.04)
(4 more...)

Genre: Research Report (0.82)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Energy (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Multidimensional Distributional Neural Network Output Demonstrated in Super-Resolution of Surface Wind Speed

Goldwyn, Harrison J., Krock, Mitchell, Rudi, Johann, Getter, Daniel, Bessac, Julie

arXiv.org Machine LearningAug-26-2025

Accurate quantification of uncertainty in neural network predictions remains a central challenge for scientific applications involving high-dimensional, correlated data. While existing methods capture either aleatoric or epistemic uncertainty, few offer closed-form, multidimensional distributions that preserve spatial correlation while remaining computationally tractable. In this work, we present a framework for training neural networks with a multidimensional Gaussian loss, generating closed-form predictive distributions over outputs with non-identically distributed and heteroscedastic structure. Our approach captures aleatoric uncertainty by iteratively estimating the means and covariance matrices, and is demonstrated on a super-resolution example. We leverage a Fourier representation of the covariance matrix to stabilize network training and preserve spatial correlation. We introduce a novel regularization strategy -- referred to as information sharing -- that interpolates between image-specific and global covariance estimates, enabling convergence of the super-resolution downscaling network trained on image-specific distributional loss functions. This framework allows for efficient sampling, explicit correlation modeling, and extensions to more complex distribution families all without disrupting prediction performance. We demonstrate the method on a surface wind speed downscaling task and discuss its broader applicability to uncertainty-aware prediction in scientific models.

artificial intelligence, machine learning, neural network, (18 more...)

arXiv.org Machine Learning

2508.16686

Country: